Asynchronous Parallel Programming Model for SMP Clusters
نویسندگان
چکیده
Our study proposes a novel MPI-only parallel programming model with improved performance for SMP clusters. By rescheduling tasks in a typical flat MPI solution, our model forces processors of an SMP node to work in different phases, thereby avoiding unneccessary communication and computation bottlenecks. This study achieves a significant performance improvement with a minimal programming effort. In comparison with a de-facto flat MPI solution, our algorithm can yield a 21% performance improvement for a 16-node cluster of Xeon dual-processor SMPs while performing a distributed matrix multiplication.
منابع مشابه
Improving Linpack Performance on SMP Clusters with Asynchronous MPI Programming
This study proposes asynchronous MPI, a simple and effective parallel programming model for SMP clusters, to reimplement the High PerformanceLinpack benchmark. The proposed model forces processors of an SMP node to work in different phases, thereby avoiding unneccessary communication and computation bottlenecks. As a result, we can achieve significant improvements in performance with a minimal ...
متن کاملTask Pool Teams: a hybrid programming environment for irregular algorithms on SMP clusters
Clusters of SMPs (symmetric multiprocessors) are popular platforms for parallel programming since they provide large computational power for a reasonable price. For irregular application programs with dynamically changing computation and data access behavior a flexible programming model is needed to achieve efficiency. In this paper we propose Task Pool Teams as a hybrid parallel programming en...
متن کاملOvercoming performance bottlenecks in using OpenMP on SMP clusters
This paper presents a new parallel programming environment called ParADE to enable easy, portable, and high-performance computing for SMP clusters. Different from the prior studies, ParADE separates the programming model from the execution model: it enables shared-address-space programming while it realizes hybrid execution of message-passing and shared-address-space. To overcome the poor perfo...
متن کاملComparison of Parallel Programming Models on Clusters of SMP Nodes
Most HPC systems are clusters of shared memory nodes. Parallel programming must combine the distributed memory parallelization on the node interconnect with the shared memory parallelization inside of each node. Various hybrid MPI+OpenMP programming models are compared with pure MPI. Benchmark results of several platforms are presented. This paper analyzes the strength and weakness of several p...
متن کاملA Taxonomy of Programming Models for Symmetric Multiprocessors and SMP Clusters
The basic processing element, from PCs to large systems, is rapidly becoming a symmetric multiprocessor (SMP). As a result, the nodes of a parallel computer will often be an SMP. The resulting mixed hardware models (combining shared-memory and distributed memory) provide a challenge to system software developers to provide users with programming models that are portable, understandable, and eff...
متن کامل